region harbors a gene associated 



WHAT IS CLAIMED IS : 

^^A^method of confirming that a candidate genor 
with a detectable trait comprising the steps of: 

constructing a candidate region distribution of test values using a plurality of biallelic 
markers in a candidate genomic region suspected of harboring said gene associated with said 
detectable trait, said candidate region distribution of test values being indicative of the 
difference in the frequencies of said plurality of biallelic markers in said candidate region in 
individuals who possess said detectable trait and/control individuals who do not possess said 
detectable trait; 

constructing a random region distribi|£ion of test values using a plurality of biallelic 
markers in random genomic regions which are not suspected of harboring said gene associated 
with said detectable trait, said random region distribution of test values being indicative of the 
difference in the frequencies of said plura/lity of biallelic markers in said random genomic 
regions in individuals who pps&ess said detectable trait and control individuals who do not 
possess said detectable trail 

determining whetheriaid candidate region distribution of test values and said random 
region distjftmtion of test values areysignificantly different from one another. 

The method of Cla/m 1, wherein said step of constructing a candidate region 
distribution of test values comprises performing a haplotype analysis on each possible 
combination of biallelic markersAn each group in a series of groups of biallelic markers in said 
candidate region, calculating test values for each possible combination, and including the test 
value for the haplotype which has the greatest association with said trait in said candidate region 
distribution of test values for each group in said series of groups of biallelic markers in said 
candidate genomic region aim wherein said step of constructing a random region distribution of 
test values comprises perfomiing a haplotype analysis on each possible combination of biallelic 
markers in each group in af series of groups of biallelic markers in said random genomic regions, 
calculating test values foneach possible combination, and including the test value for the 
haplotype which/nas the/greatest association with said trait in said random region distribution of 
test values fo^each gropp in said series of groups of biallelic markers in said random genomic 
regions. 

The jSiethod of Claim 2, wherein said steps of performing a haplotype analysis 
on eaofi possible combination of biallelic markers in each group in said series of groups of 
biallelic markers irj said candidate genomic region and calculating said test values for each 
combination comprises the steps of: 
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calculating the frequencies for each combination of biallelic markers in each group in 
said series of groups of biallelic markers in said c/ndidate genomic region in individuals 
expressing said detectable trait; 

calculating the frequencies for each combination of biallelic markers in each group in 
said series of groups of biallelic markers in s^d candidate genomic region in individuals who 
do not express said detectable trait; and 

comparing the haplotype frequencies in individuals who express said trait and 
individuals who do not express said trait py performing a chi-squared analysis to yield said test 
values. / 

A./ The method of Claim 3', wherein said steps of performing a haplotype analysis 
on each^possible combination of biallelic markers in each group in said series of groups of 
biallelic markers in said random genomic regions and calculating said test values for each 
combination comprises the steps on 

calculating the frequencies for /each combination of biallelic markers in each group in 
said series of groups of biall/flic iparjtfers in said random genomic regions in individuals 
expressing said detectable i 

calculating the frequencies for each combination of biallelic markers in each group in 
said series of groups of biallelic markers in said random genomic regions in individuals in 
individuals who do not express said detectable trait; and 

comparing the haplorype frequencies in individuals who express said trait and 
individuals who do not exprpss said trait by performing a chi-squared analysis to yield said test 
values. / 

A. The methda of Claim 4, wherein said step of comparing said candidate region 
distribution of test values /o said random region distribution of test values comprises performing 
a Wilcoxop rank test. 

A. i The metfiod of Claim 4, wherein said step of comparing said candidate region 
distribution/of test values to said random region distribution of test values comprises performing 
a Kolmpgorov-Smirnoy test. 

The r^ethod of Claim 4, said step of comparing said candidate region 
distribution^ test values to said random region distribution of test values comprises performing 
both a Wjfcoxon rank test and a Kolmogorov-Smirnov test. 

<B. The method of Claim 4, wherein each of said groups of biallelic markers in said 
seri^/of groups of oiallelic markers in said candidate genomic region and each of said groups of 
biallelic markers in said series of groups of biallelic markers in said random genomic regions 
comprises 3 biallelic markers. 
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JtfT The method of Claim 4, wherein each of said groups of biallelic markers in said 
series of groups of biallelic markers in said candidate genomic region and each of said groups of 
biallelic markers in said series of groups of bialleljc markers in said random genomic regions 
comprises aUeast 3 biallelic markers. 

IQl The method of Claim 4, whereiA said biallelic markers in each of said groups in 
said series of groups of biallelic markers in said candidate genomic region have an average 
intermarker distance selected from the group consisting of one marker every 3kb, one marker 
every 5kb/one marker every lOkb, one marker every 20kb, and one marker every 30kb. 

1. The method of Claim 10, wHerein said biallelic markers in each of said groups 
in said series of groups of biallelic markers in said random genomic regions have an average 
intermarker distance selected from the groin) consisting of one marker every 3kb, one marker 
every 5kb,eme marker every lOkb, one marker every 20kb, and one marker every 30kb. 

1/2. The method of Claim 4 fprther comprising selecting random genomic regions 
for use in s^fid haplotype analysis which iiave at least 3 biallelic markers therein. 

1/3. The method of CleinL 12, further comprising selecting random genomic regions 
for use in said haplotype analysis in wmch said biallelic markers have an average intermarker 
distance sufficient conductingsa^pldtype analysis. 

14: The method of Claim 13 further comprising selecting random genomic regions 
for use in said haplotype analysis Wherein said at least 3 biallelic markers are in Hardy- 
Weinberg equilibrium in individuals expressing said detectable trait and control individuals who 
do not express said detectable trait. 

15. The method of Claim 14 further comprising selecting random genomic regions 
for use in said haplotype analy/is in which said at least 3 biallelic markers are not in complete 
linkage disequilibrium to be useful in conducting a haplotype analysis. 

r6. The method/of Claim 3 further comprising selecting biallelic markers in said 
candidate genomic region which are in Hardy- Weinberg equilibrium in individuals expressing 
said detectable trait and control individuals who do not express said detectable trait for use in 
said haplotype analysis. 

A 7. / The mefhod of Claim 16 further comprising determining the total number of 
markers unsaid candidate genomic region. 

LB. The method of Claim 4 further comprising the step of verifying that the 
bialleli(^markers in ^aid random genomic regions are appropriate for use in the haplotype 
analysis by: 
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randomly dividing said biallelic markers m said random genomic regions into a first 
verification group and a second verification group, wherein said first verification group and said 
second verification group contain a substantially identical number of biallelic markers; 

constructing a first verification distribution of test values for the biallelic markers in 
said first verification group by performing a Haplotype analysis on each possible combination of 
biallelic markers in each group in a series oy groups of biallelic markers in said first verification 
group, calculating test values for each possible combination, and including the test value for the 
haplotype which has the greatest association with said trait in said first verification distribution 
of test values for each group in said series of groups of biallelic markers in said first verification 
group; / 

constructing a second verification distribution of test values for the biallelic markers in 
said second verification group by performing a haplotype analysis on each possible combination 
of biallelic markers in each group in a series of groups of biallelic markers in said second 
verification group, calculating test values for each possible combination, and including the test 
value for the haplotype which has the greatest association with said trait in said second 
verification distribution of test values for each group in said series of groups of biallelic markers 
in said second verification group;/ 

determining whether saia first verification distribution and said second verification 
distribution are significantly different from one another, wherein said biallelic markers in said 
random genomic regions zvt \ appropriate for use in the haplotype analysis if said first 
verification distribution and sa&seefSnd verification distribution are not significantly different 
from one apwher. / 

The method Li Claim 1 8 wherein said steps of performing a haplotype analysis 
on eaai possible combination of biallelic markers in each group in said series of groups of 
biallelic markers in said firsjt and second verification groups and calculating said test values for 
each combination comprises the steps of: 

calculating the frrauencies for each combination of biallelic markers in said first 
verification group in eacn group in said series of groups of biallelic markers in individuals 
expressing said detectable trait; 

calculating the frequencies for each combination of biallelic markers in said first 
verification group in each group in said series of groups of biallelic markers in individuals who 
do not express said detectable trait; 

comparing tme haplotype frequencies of said biallelic markers in said first verification 
group in individuals who express said trait and individuals who do not express said trait by 
performing a chi-sauared analysis to yield said test values; 
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calculating the frequencies for each combination of biallelic markers in said second 
verification group in each group in said series of groups of biallelic markers in individuals 
expressing said detectable trait; / 

calculating the frequencies for each combination of biallelic markers in said second 
verification group in each group in said seriep of groups of biallelic markers in individuals who 
do not express said detectable trait; / 

comparing the haplotype frequencies of said biallelic markers in said second 
verification group in individuals who express said trait and individuals who do not express said 
trait by performing a chi-squared analysis to yield said test values 

20. The method of Claim w, wherein said step of determining whether said first 
verification distribution and said second verification distribution are significantly different from 
one another comprises performing a Wilcoxon rank test on said first and second verification 
distributions. / 

'2 1 . The method of Claim 1 9, wherein said step of determining whether said first 
verification distribution ana said Wcond verification distribution are significantly different from 
one another comprises rerfprtnmgfa Kolmogorov-Smirnov test on said first and second 
verification distributions. / 

(22. The method of Claim 19, wherein said step of determining whether said first 
verification distribution and said second verification distribution are significantly different from 
one another comprises performing a both a Kolmogorov-Smirnov test and a Wilcoxon rank test 
on said firar and second verification distributions. 

/23. The method jof Claim 19, wherein each of said groups of biallelic markers in 
said series of groups of biallelic markers in said first verification group and each of said groups 
of biallelic markers in said series of groups of biallelic markers in said second verification 
group contjuns 3 biallelic markers. 

>24. The metnod of Claim 19, wherein each of said groups of biallelic markers in 
said series of groups of Biallelic markers in said first verification group and each of said groups 
of biallelic markers in said series of groups of biallelic markers in said second verification 
group contains more man 3 biallelic markers. 

/25y The method of Claim 1, wherein said method is performed by a computer. 
to. The/method of Claim 25, wherein said computer provides an output indicative 
of whether ^aid candidate region distribution of test values and said random region distribution 
of test values are Significantly different. 

21. The method of Claim 26 further comprising further evaluating said candidate 
genonjic region to identify candidate genes which might be associated with said detectable trait 



if said output indicates that said candidate region distribution of test values and said random 
region distribution of test values are significantly different. 

2y The method of Claim 1 further comprising further evaluating said candidate 
genomic region to identify candidate genes which might be associated with said detectable trait 
if said candidate region distribution of testyvalues and said random region distribution of test 
values are significantly different. 

The method of Claim 4^ wherein the frequencies for each combination of 
biallelic markers in each group in saidr series of groups of biallelic markers in said candidate 
genomic region and in said rmdqm genomic regions in individuals expressing said detectable 
trait are calculated using t^Stpe^ation Maximization algorithm; and 

the frequencies for each Combination of biallelic markers in each group in said series of 
groups of biallelic markers in said candidate genomic region and said random genomic regions 
in individuals who do not express said detectable trait are calculated using the Expectation 
Maximization algorithm. 

yo. A method of determining whether a candidate genomic region harbors a gene 
associated with a detectable trait comprising determining whether the association of a plurality 
of biallelic markers located in said candidate genomic region with said detectable trait is 
significantly different taan the association of a plurality of biallelic markers located in a 
plurality of random genomic regions. 




31. 



The method 



of Claim 30 wherein the determination of whether the association 



of said plurality of biallelic i larkers located in said candidate genomic region with said 
detectable trait is significant y different than the association of said plurality of biallelic markers 
located in a plurality of rand Dm genomic regions comprises: 

constructing a candidate region distribution of test values using said biallelic markers in 
said candidate genomic region, said candidate region distribution of test values being indicative 
of the difference in the haplptype frequencies of said biallelic markers in said candidate region 
in individuals who possess \ aid detectable trait and control individuals who do not possess said 
detectable trait; 

constructing a rancjbm region distribution of test values using said biallelic markers in 
said genomic region said random region distribution of test values being indicative of the 
difference in the haplotype frequencies of said biallelic markers in said random genomic regions 
in individuals who posses s said detectable trait and control individuals who do not possess said 
detectable trait; and 

comparing said c andidate region distribution of test values with said random region 
distribution of test values . 
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*ik. The method of Claim 3 1 , wherein said step of constructing a candidate 
region distribution of test values composes performing a haplotype analysis on each possible 
combination of biallelic markers in eafch group in a series of groups of biallelic markers in said 
candidate region, calculating test valups for each possible combination, and including the test 
value for the haplotype which has the greatest association with said trait in said candidate region 
distribution of test values for each group in said series of groups of biallelic markers in said 
candidate genomic region and when in said step of constructing a random region distribution of 
test values comprises performing a lLaplotype analysis on each possible combination of biallelic 
markers in each group in a series of groups of biallelic markers in said random genomic regions, 
calculating test values for each possible combination, and including the test value for the 
haplotype which has the greatest as ;ociation with said trait in said random region distribution of 
test values £or each group in said sejries of groups of biallelic markers in said random genomic 
regions. 

33. A computer systeiA for confirming that a candidate genomic region harbors a 
gene associated with a detect^Ie^ait, wherein the computer system comprises instructions that 
when executed perform the/ 

constructing a candidate region distribution of test values using a plurality of biallelic 
markers in a candidate genomic re ;ion suspected of harboring said gene associated with said 
detectable trait, said candidate reg on distribution of test values being indicative of the 
difference in the frequencies of sa d plurality of biallelic markers in said candidate region in 
individuals who possess said dete ;table trait and control individuals who do not possess said 
detectable trait; 

constructing a random re jion distribution of test values using a plurality of biallelic 
markers in random genomic regions, said random region distribution of test values being 
indicative of the difference in the frequencies of said plurality of biallelic markers in said 
random genomic regions in indn iduals who possess said detectable trait and control individuals 

le trait; and 

\ candidate region distribution of test values and said random 
region distribution of test valued are significantly different from one another. 

3y£ The computer system of Claim 33, wherein said instructions for constructing a 
candidate region distribution of test values comprise instructions for performing a haplotype 
analysis on each possible comb nation of biallelic markers in each group in a series of groups of 
biallelic markers in said candidate region, calculating test values for each possible combination, 



who do not possess said detectat 
determining whether sai 



and including the test value for 



the haplotype which has the greatest association with said trait 



in said candidate region distribi tion of test values for each group in said series of groups of 
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biallelic markers in said candidate genomid region and wherein said instructions for 
constructing a random region distribution 6f test values comprise instructions for performing a 
haplotype analysis on each possible combination of biallelic markers in each group in a series of 
groups of biallelic markers in said random genomic regions, calculating test values for each 
possible combination, and including thef test value for the haplotype which has the greatest 
association with said trait in said randdm region distribution of test values for each group in said 
series of groups of biallelic markers in said random genomic regions. 

p. The computer system of Claim 34, wherein said instructions for performing a 
haplotype analysis on each possible Combination of biallelic markers in each group in said 
series of groups of biallelic markers fin said candidate genomic region and calculating said test 
values for each combination comprise instructions for: 

calculating the frequencies /for each combination of biallelic markers in each group in 
said series of groups of biallelic markers in said candidate genomic region in individuals 
expressing said detectable trait; / 

calculating the frequencies for each combination of biallelic markers in each group in 
said series of groups of bipelicrodrkers in said candidate genomic region in individuals who 
do not express said detectable trait; and 

comparing the haplotype frequencies in individuals who express said trait and 
individuals who do not expres^ said trait by performing a chi-squared analysis to yield said test 
values. 

36. The compute! system of Claim 35, wherein said instructions for performing a 
haplotype analysis on each possible combination of biallelic markers in each group in said 
series of groups of biallelic markers in said random genomic regions and calculating said test 
values for each combination comprise instructions for: 

calculating the frecniencies for each combination of biallelic markers in each group in 
said series of groups of biallelic markers in said random genomic regions in individuals 
expressing said detectable trait; 

calculating the frequencies for each combination of biallelic markers in each group in 
said series of groups of mallelic markers in said random genomic regions in individuals in 
individuals who do not express said detectable trait; and 

comparing the haplotype frequencies in individuals who express said trait and 
individuals who do not? express said trait by performing a chi-squared analysis to yield said test 
values. 
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jyf. The computer system of Clailn 36, wherein said instructions for comparing said 
candidate region distribution of test values to said random region distribution of test values 
comprise instructions for performing a Wilcoxon rank test. 

yX. The computer system of Qlaim 36, wherein said instructions for comparing said 
candidate region distribution of test valurfs to said random region distribution of test values 
comprise inductions for performing a Kolmogorov-Smimov test. 

The computer system If Claim 36, wherein said instructions for comparing said 
candidate region distribution of test values to said random region distribution of test values 
comprise instructions for performing/both a Wilcoxon rank test and a Kolmogorov-Smirnov 
test. / 

40. A programmed stoijbge device comprising instructions that when executed 
perform the steps of: 

constructing a candidate region distribution of test values using a plurality of biallelic 
markers in a candidate genomic region suspected of harboring said gene associated with said 
detectable trait, said traiyassociMed^istribution of test values being indicative of the difference 
in the frequencies of saiiptoriltty of biallelic markers in said candidate region in individuals 
who possess said detectable tra/t and control individuals who do not possess said detectable 
trait; 

constructing a randoni region distribution of test values using a plurality of biallelic 
markers in random genomic regions, said random region distribution of test values being 
indicative of the difference in the frequencies of said plurality of biallelic markers in said 
random genomic regions in individuals who possess said detectable trait and control individuals 
who do not possess said detectable trait; and 

determining whetner said candidate region distribution of test values and said random 
region distribution of test Values are significantly different from one another. 

The programmed storage device of Claim 40, wherein said instructions for 
constn/cting a candidate distribution of test values comprise instructions for performing a 
haplotype analysis on each possible combination of biallelic markers in each group in a series of 
groups of biallelic markers in said candidate region, calculating test values for each possible 
combination, and including the test value for the haplotype which has the greatest association 
with said trait in said candidate region distribution of test values for each group in said series of 
groups of biallelic markers in said candidate genomic region and wherein said instructions for 
constructing a random region distribution of test values comprise instructions for performing a 
haplotype analysis o i each possible combination of biallelic markers in each group in a series of 
groups of biallelic n arkers in said random genomic regions, calculating test values for each 
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possible combination, and including the test value for the haplotype which has the greatest 
association with said trait in said random region distribution of test values for each group in said 
series of grp^ips of biallelic markers in said random genomic regions. 

A2. The programmed storage device of Claim 41, wherein said instructions for 
performing a haplotype analysis on each possible combination of biallelic markers in each 
group in said series of groups of biallelic markers in said candidate genomic region and 
calculating said test values fpr each combination comprise instructions for: 

calculating the frea|uencies for each combination of biallelic markers in each group in 
said series of groups of biallelic markers in said candidate genomic region in individuals 
expressing said detectable trait; 

calculating the frequencies for each combination of biallelic markers in each group in 
said series of groups of Biallelic markers in said candidate genomic region in individuals who 
do not express said detectable trait; and 

companng tfteJiaplotype frequencies in individuals who express said trait and 
individuals yftio fo ijot|e^ress said trait by performing a chi-squared analysis to yield said test 
values. 

/43. The programmed storage device of Claim 42, wherein said instructions for 
perf9fming a haplotyp s analysis on each possible combination of biallelic markers in each 
group in said series of groups of biallelic markers in said random genomic regions and 
calculating said test v ilues for each combination comprise instructions for: 



calculating th[e 
said series of groups 



frequencies for each combination of biallelic markers in each group in 
)f biallelic markers in said random genomic regions in individuals 



expressing said detec able trait; 



calculating ft 
said series of groups 
individuals who, do 



e frequencies for each combination of biallelic markers in each group in 
1 pf biallelic markers in said random genomic regions in individuals in 
3t express said detectable trait; and 
comparing t&e haplotype frequencies in individuals who express said trait and 
individuals vpo do ipt express said trait by performing a chi-squared analysis to yield said test 
values. 

Thd programmed storage device of Claim 43, wherein said instructions for 
comparing sajcl candidate region distribution of test values to said random region distribution of 
test values /omprisp instructions for performing a Wilcoxon rank test. 

: programmed storage device of Claim 43, wherein said instructions for 
comparing said candidate region distribution of test values to said random region distribution of 
test values comprise instructions for performing a Kolmogorov-Smirnov test. 
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f6. The programmed storage device of Claim 43, wherein said instructions for 
comparing said candidate region distribution of test values to said random region distribution of 
test values comprise instructions for performing both a Wilcoxon rank test and a Kolmogorov- 
Smirnov^test. 

47. Thfe'p&grammed storage device of Claim 40, wherein said programmed storage 
device is selected frrfm the group consisting of a hard disk, a floppy disk, Random Access 
Memory, Read Onfy Memory and Electrically Eraseable Programable Read Only Memory. 
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